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(54) Abstract Title 

Compression coding of video data 



(57) A DVD or other master compression operation is divided into a first step which produce a high bit rate 
primary coded bitstream. The maximum bit count for tlie DVD is then achieved In a second step of single or 
repeated transcoding of said primary bit-stream with no reference being made to the original video signal. 
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COMPRESSION ENCODING OF VIDEO 

This invention relates to MPEG and other compression encoding of 
video and is particularly concerned with non-reai-time encoding applications 
such as DVD authoring. 

A particular feature of DVO authoring is that whilst the maximunfi bit 
5 allocation is of course fixed by the storage capacity of the disc and the 

amount of ancillary material that the disc is also required to canry, the bit rate 
can. over the run length of the DVD, vary considerably with time. Advantage 
is taken of this feature to ensure that particularly critical material is encoded at 
high quality, at the expense of lowering the bit rate in less critical areas. 

1 0 Cunrent DVD authoring techniques generally involve a combined 

solution of a video tape recorder (VTR), a hardware MPEG2 encoder and an 
appropriate computer running control and authoring software and accessing 
bit-stream file storage. A typical process entails many sessions of playing the 
material from the VTR and encoding by the encoder, controlled precisely by 

1 5 the computer, with the resultant MPEG stream being captured by the 
computer. The approach is iterative, with each sut>sequent process of 
encoding the video being guided by the results of the previous attempts. The 
aim is achieve the desired quality throughout the material with the overall 
constraint of fitting the encoded MPEG within the target file size. Judging the 

20 desired quality in individual scenes or other sections of the video material is a 
subjective issue, often controlled- where the material is cinematographic film 
originating - by ttie film director, or by a quality control agent acting on his 
behalf. 

There are a number of disadvantages associated with current 
25 techniques. For example, the current metiiod requires full availability and 
> close Interaction of the VTR and the encoder throughout the authoring 

process. Also, the current metiiod has limitations on ttie level of control that 
can be applied to the encoding process, such as the precise application of 
encoding effort to small portions of the material. 
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It Is an object of the present invention to provide an improved 
compression coding process for use in DVD authoring or other applications 
where there exists a predetermined limit on the aggregate bit total. 

Accordingly, the present invention consists in one aspect in a process 
for compression coding video material to produce a coded bitstream wHh a 
predetermined limit on the aggregate bit total, comprising a first step of 
compression encoding to produce a primary coded bitstream having an 
aggregate bit total substantially in excess of said limit and a second step of 
single or repeated transcoding of said primary bit-stream to produce a 
secondary bitstream having a variable bit rate and an aggregate bit total 
within said predetenmlned limit. 

In a preferred form of the invention, a single reference encoding pass 
is performed at a generally fixed bit-rate. 

In an alternative, a single reference encoding pass is perfomied at a 
generally fixed quality level. 
5 This bit-rate Is high enough to ensure that even the most demanding 

sections of the material (in terms of the difficulty of compression coding) are 
coded to give a quality level sufficient to give visibly perfect pictures. All 
subsequent alterations of the instantaneous bit-rate are perfbnned by 
transcoding the original pass data, leaving coding modes unchanged. This 
10 transcoding Is performed in software. In addition to coding modes, all other 
coding decisions that remain appropriate in the transcoding process, are re- 
used from the primary encoding. 

This preferred fonri of the Invention offers a number of specific 

advantages. 

1 5 For example, the VTR and hardware MPEG encoder are only required 

for a single encoding session equal In time to the mntlme of the material 
being encoded. Also, the task of controlling the VTR and hardware MPEG 
encoder is minimised. 

There Is greater flexibility in the control of the coding quality, as it is 

20 totally under the control of the transcoding software on the computer. 
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This invention will now be described by way of examples. 
In the first step of a process according to one example of this 
invention, the source material in tape fomiat is Inserted into a VTR. and the 
start and end time-codes are set on a controlling computer. The computer 
5 instructs a hardware MPEG-2 encoder to process the material from the start 
time-code until the end time-code with a high quality level, sufficient for 
pictures that are visibly perfect. By "visibly perfecf is meant in practical tenns 
a picture quality that is everywhere at least equal to the highest picture quality 
that might foreseeably be demanded for any one picture or sequence in the 
10 transcoded, output bit-stream. The computer starts the VTR. and captures 
the primary encoded bitstream to disk. Once the whole material has been 
played the VTR is stopped, and the VTR and encoder can be released for 

other uses. 

The computer is then used to mn transcoding MPEG-2 software to re- 
15 quantise the bit-stream, to achieve the desired quality within the permissible 
file size. This can be a multi-pass and/or multi-part operation, with quality 
assessments at various points. 

In more detail, the first transcoding pass generates the variable bit-rate 
elementary stream (.vbs file). A parameter file, looking very similar to the SSG 
20 (Software Simulation Group: a publk; domain software encoder) .par file, is 
conveniently used to specHy:- 

Average bitrate 
Max. bitrate 
25 Min. bitrate 

Min. quantiser 

I, P & B relative quantisers 

etc. 

30 The Average bitrate is taken ftom the bit budget spreadsheet, the 

maximum bitrate is usually set to 8.8 Mbit/s and the minimum to around 1.25 
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MbH/s. Exceeding these limits usually crashes the DVD players. In a more 
complex system, the rate control algorithm could make use of additional 
information from the reference pass. 

The .vbs files are then subjected to a quality control (QC) process to 

5 check the video encoding quality. There will typically be a QC agent 

employed by the organisation responsible for the quality of the original picture 
material, and not by the organlsatfon conducting the DVD authoring. The QC 
agent will be expected to view critically the decoded and displayed bit-stream 
and may single step through Individual frames to pick-up problems. The QC 

1 0 agent will provide a set of notes indicating the nature of the fault and the 
timecode at which it occurred. 

The scenes identified by the QC check are reviewed and an attempt is 
made to re-code them. In one appiicatton. the user Is Invited to enter an IN 
point, an OUT point and a bitrate scaling, between 0.5 and 2.0, relative to the 

1 5 old .vbs. (It might be necessary to release bit-rate from one scene to be able 
to allocate mor« to another). The system then re-codes the segment and 
invites the user to accept or reject the results. If accepted, a further file, 
rather like an EDL is annotated to Indicate that the new elementary stream file 
should replace the first transcoded bitstream between the given timecodes. 

20 Once all of the necessary scenes have been re-coded, the individual 

files are combined to provide a single .vbs file for the whole segment, rather 
like conforming an edit. 

There are some important addittonal benefits of the transcoding 
approach according to the present invention. Because transcoding requires 

25 so much less processing than full re-encoding, it may be possible to do it 
faster than real time. So if the QC check identified a large number of 
changes, it may be nrwre convenient to transcode the whole bitstream again 
rather than just the portions marked for change. In any case, because the 
system requires fewer hardware resources than the prior art, it would be 

30 possible to make more attempts to arrive at an ideal balance of subjective 
quality within the desired bit budget. 
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Another advantage of the present invention stems from the much 
greater control of the quantiser in software transcoding as compared with a 
real hardware encoder. It would be possible, for example, to identify discrete 
areas of the picture that are of particular interest to the viewer or producer 
5 (such as the actor's face or a product positioned for marketing purposes) and 
to ensure that quantisation Is particularly fine for the identified picture area. 
Other parameters can also be tuned to ensure that particulariy critical areas 
of a picture can be encoded at the highest quality. 

By ensuring that the partial or complete re-encode process that is 

10 inherent in the transcoding process, uses so far as is practicable the same 
coding decisions as the primary encoding operation, cascade losses are 
avoided or minimised. For a discussion of this concept, reference is directed 
to WOg5/35628. WO98/03017. and PCT/GB99/00228. which are hereby 
Incorporated by reference. In these same references, is a discussion of a 

1 5 partial decode process that might be relevant in the present context 
In another embodiment, which shares some but not all of the 
advantages mentioned above, the reference encoding pass is replaced by an 
analysis of the video material to detemiine what coding decisions a hardware 
compression encoder would have taken in encoding the material. The first 

20 transcoding step is then replaced by a re-encoding process guided by those 
coding decisions. For a description of this type of pre-processing, reference 
is directed in particular to FCT7G699/00228. 

There would not be in this case be the same the saving In VTR use, 
but there would still be the possibility to save encoder resources and (using 

25 an HDTV VTR for example) the potential for faster-than-real-time operation. 



1 . A process for compression coding video material to produce a coded 
bitstream with a predetermined limit on the aggregate bit total, comprising 
a first step of compression encoding to produce a primary coded bitstream 
having an aggregate bit total substantially in excess of said limit and a 
second step of single or repeated transcoding of said primary bit-stream to 
produce a secondary bitstream having a variable bit rate and an 
aggregate bit total within said predetenmined limit. 

2. A process according to Claim 1 , wherein said first step is conducted on a 
dedicated hardware compression encoder and second step is conducted 
using compression transcoding software mnning on general purpose 
computing apparatus. 

3. A process according to Claim 1 or Claim 2, wherein the first step of 
compression encoding is perfonmed at a generally fixed bit rate. 

4. A process according to Claim 1 or Claim 2, wherein the first step of 
compression encoding is performed at a generally fixed quality level. 

5. A process according to any one of the preceding claims, wherein said first 
step Is so adapted that the picture quality at every point In the primary 
coded bit-stream is at least as high as the highest demanded picture 
quality for any one picture in the secondary bit-stream. 

6. A process according to any one of the preceding claims, wherein the 
compression encoding is based on MPEG . 

7. A process according to any one of the preceding claims, for DVD 
authoring. 
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